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COMMUTATORS, PARAPRODUCTS AND BMO IN 
NON-HOMOGENEOUS MARTINGALE SETTINGS 

SERGEI TREIL 



Abstract. In this paper we investigate the relations between (martingale) BMO spaces, 
paraproducts and commutators in non-homogeneous martingale settings. Some new, and 
one might add unexpected, results are obtained. Some alternative proof of known results 
are also presented. 
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Notation 
X real line K or its subinterval. 

C Lattice of intervals in X. 



1, characteristic function of the set /. 

E, averaging operator, E^/ = l^/l -1 fj f(x)dx. 

A , martingale difference operator, A, = ( Sjechiid(J) ^j ) ~~ ^ri nere child(J) de- 

notes the collection of the "children" of /. 
(/)j average value of the function /, (f)j = j-j f(x)dx := |/| _1 fj f(x)dx. 

1. Introduction and main objects 

This paper was started in attempt to understand the relations between (martingale) com- 
mutators, paraproducts and space BMO. Initial hope was to cover both one-parameter and 
multi-parameter cases, but it became clear pretty soon that in the general, non-homogeneous 
case, even one-parameter situation is far from well understood. 
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2 SERGEI TREIL 

While the results about .H^-BMO duality for general martingales are well known and can 
be considered classical, paraproducts and commutators were studied mostly for regular r-adic 
martingales. 

In this paper several new, and one might add unexpected, results are obtained for the 
non-homogeneous situation. Let me list some of them here; for the definitions and exact 
statements the reader should look in the sections that follow. 

• Despite what one might expect, the condition b G BMO is not necessary (although 
it is of course, sufficient) for the boundedness of the paraproduct tx^ in LP . This 
means, in particular, that unlike the homogeneous case it is impossible to characterize 
b € BMO via boundedness of commutators of the multiplication operator Mj, and 
martingale multipliers. 

The condition b € BMO however is necessary and sufficient for the boundedness 

(*) 
of the so-called extended paraproduct 71^ . 

• The necessary an sufficient condition for the LP boundedness of the paraproduct is, 
as one might expect, that it is enough to check the boundedness on the characteristic 
functions of intervals. This statement is well-known and now almost trivial for p = 2; 
the result for p 7^ 2 is new and its proof is rather complicated. 

Note, that this condition depends on p, unlike the condition b £ BMO, which 
guarantees the the boundedness of ir b * hi all L p , p € (1, 00). 

• The condition b € BMO is, as one might expect, sufficient for the LP boundedness 
(p € (l,oo)) of the commutator [JVf6,T] = MbT — TMj, of the multiplication operator 
Mb and a bounded martingale transform T. This condition (up to some technical 
details) is also necessary for the boundedness of the commutator, provided that the 
martingale transform T satisfies some "mixing property". 

This result generalizes the classical result of S. Janson [8], which gives the descrip- 
tion of BMO via commutators in the case of regular r-adic martingales. The "mixing 
properties" that the martingale transform should satisfy generalize (and in the case 
of regular r-adic lattice coincide with) the notion of the non-degenerate martingale 
transform, considered in [5]. 

The "mixing condition" introduced in this paper is necessarily more complicated 
than the non-degeneracy condition in [8]. This is mainly due to the fact that it 
includes a condition that was "hidden" (trivially satisfied) in the homogeneous case. 
An example, demonstrating that this "hidden" condition is essential is presented in 
the paper. 

• It is shown in this paper that in general non-homogeneous case the martingale dif- 
ference spaces Dj = A,L P do not form the so-called strong unconditional basis in 
L p , p/ 2 (more precisely, in the martingale Hardy space H p , which is, in general 
for p E (l,oo) a subspace of LP with an equivalent metric). Essentially that means 
that it is impossible to define an equivalent norm in H p using only the norms of 
martingale differences ||A ,/|| p . 

An equivalent statement is that, unlike the case p = 2, for p/ 2 there exists an 
unbounded in H p martingale transform T (see the definition in the subsections that 
follow) with uniformly bounded blocks T,. 

Few word about general setup used in the paper. We do not work here in the settings of 
martingale spaces, because we want to include the situation with infinite measure, like the 
standard dyadic lattice in W 1 . While getting results in the case of infinite measure from the 
corresponding result in the martingale case (the finite measure) is usually pretty easy, there 
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are some delicate situation, when one has to be careful stating the result. (Of course, usually 
after the results are stated, they are quites easy to prove, but stating the results require some 
attention). 

For example, while this is well known to specialists, it might be a surprise to a reader just 
casually acquainted with martingale Hardy spaces and BMO, that for the standard dyadic 
lattice in R (and in IR n ) one can find a function b in dyadic BMO such that the martingale 
difference decomposition ^7"ex>^/^ diverges a.e. I haven't seen this mentioned anywhere in 
the literature, probably everybody had to notice this fact for him/her-self. 

So, in this paper we work on the real line R, and our u-algebras are generated by disjoint 
intervals. While practically everything can be stated and proved in the setting of arbitrary 
measure space, we want to avoid non-essential technical details and concentrate on main 
ideas. For example, at some point we will be using Fefferman-Stein maximal theorem, which 
is stated and proved for M n but not for an arbitrary measure space. 

The settings on the real line covers the example we are mostly interested in: the case of R n 
with the standard dyadic lattice and with an arbitrary Radon measure fj,, where the averages 
are taken with respect to [i. Such situation is typical in the non-homogeneous harmonic 
analysis, cf [TO1 CDS [H] 

1.1. Lattices, expectations and martingale differences. Let X be either real line R or 
its subinterval (finite or infinite) A lattice C is a collection of non-trivial finite (bounded) 
intervals of X (say for definiteness of form [a, b)) with the following properties. 

(i) C is a union of generations Ck, k € Z, where each generation is a collection of disjoint 

intervals, covering X. 
(ii) for each k 6 Z, the covering Ck+i is a finite refinement of the covering Ck, i.e. each 
interval / € Ck is a finite union of disjoint intervals J £ Ck-\-l- We allow the situation 
where there is only one such interval J (i.e. J = I); this means that I £ Ck also 
belongs to the generation Ck+l- 

Example. The main example we have in mind is the following one. Consider the space M d 
with a Radon measure fj, and the standard dyadic lattice. Let us represent cubes Qk = Q\ = 
[0, 2 ) by the intervals l\ = [0, fJ,(Qk)) C R. For each cube Q\ we pick some ordering of 
its children (dyadic subcubes of Q\ with side 2 fc ~ 1 ) with Q\_ 1 = [0, 2 k ~ 1 ) d being the first, 
and split l\ into disjoint union of intervals I 1 k _ 1 , of form [a, b), \I 3 k _i\ = MQfc-i)> with the 
ordering of the intervals ijLi given by the ordering of Q^-r 

We then can order children of CK, j / 1 and represent them as subintervals of P k , then 
their children, and so on. 

So we have represented the standard dyadic lattice in the first "octant" [0, oo) d of M d with 
the measure /i by our lattice (with X = [0, n([0, oo) d )), so the measure of each dyadic cube 
equals the length of the corresponding interval. Note, then the dyadic cubes Q, fi(Q) = are 
ignored, the corresponding intervals are empty sets. 

If the measure /i is finite, we can represent the dyadic lattice in all R d as our lattice: in 
general, we can only put 2 "octants" on the line, but the dyadic lattice on the whole space 
can be represented as a finite disjoint union of our lattices. 

1.1.1. More definitions. For an interval / € C, let rk(I) be the rank of the interval /, i.e. the 
largest number k such that Ck 3 I- 

For an interval I £ C, rk(J) = k a child of / is an interval J € Ck+i such that J C I (note 
that by the definition of rk(J) we cannot have J = I, so we can write J £ I). The collection 
of all children of / is denoted by child(J). 
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Let r G Z. We will call a lattice £ a r-adic lattice, if every generation C k consists of 
intervals of equal length, and the generation C k +\ is obtained from Ck by dividing every 
interval I £ Ck into r equal subintervals. 

When r = 2 we have a dyadic lattice in R; if r = 2 d , the lattice represents a dyadic lattice 



in ul d . 



We say, that a lattice £ is homogeneous if 

(i) Each interval I G £& is a union of at most r (r < oo) intervals J € £&+i 
(ii) There exists a constant If < oo such that |/|/| J| < K for every I G C k and every 
J G £fc+i, J C I. 

We say that the lattice £ is proper, if any interval I € £fc+i is a proper subinterval of an 
interval J G C. in this case any interval I G £ belongs to a unique generation. 

1.1.2. Conditional expectations and martingale differences. For an interval I G C let E 7 be 
the averaging operator, 



Ejf := ({II- 1 Jf(x)dx) 1 7 =: (/) / l / 



and let E^ be the "conditional expectation" , 

E fc / = ^ E 7 /. 
/e£ fc 

Consider martingale differences A , A& 

A / = ( E E j)- E /' A fc = E fe -E fc _ 1= Yl A i 

Jgchild(J) Je£:rk(i")=fc-1 

(note that we cannot write A/% = Yli^c A r nere )- 

Let 2lfc be the cr-algebra generated by Ck (i.e. countable unions of intervals in Ck)- Let 
2loo be the smallest cr-algebra containing all 2L., k G Z, and let 2l-oo be the largest cr-algebra 
containing in all 21&, 2l_oo = r\ e z2L.. 

The structure of a-algebras 2loo and 2l_oo is easy to understand. Thus, 2l_oo is the cr- 
algebra generated by all the intervals I of form 

I = [J Ik, where 4 G C k , I k C 4-1- 

fcez 

Note that X is a disjoint union of such intervals I and at most countably many points (we 
might need to add left endpoints to the intervals /, if they happen to be open intervals). It is 
possible that there is only one such 1,1 = %, so the cr-algebra 2l_oo is trivial. Let us denote 
the collection of such intervals / by Sl -^. Define 

(1.1) 2l°t n := {I G 21°^ : \I\ < oo}; 

"fin" here is to remind that the set consists of intervals of finite measure. 

For example, in the case of the standard dyadic lattice in M., we have that Sl ^ = 
{[0, oo), (-oo, 0)} and so 21°^ = 0. 

Instead of describing 2loo, let us describe the corresponding measurable functions. Namely, 
a function / is Sloo-measurable, if it is Borel measurable and it is constant on intervals I 



P| I k , where I k G C k , I k C I k -i- 



k(-_l 
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Clearly, such intervals I do not intersect, so there can only be countably many of them. Note, 
that if we assume that for every x € X 

(1.2) lim |J fc (x)|=0, 

k— >+oo 

where Ik(x) is the unique interval in Ck containing x, then 2loo is the Borel c-algebra. 

1.2. Martingale difference decomposition of LP spaces. In this paper we always assume 
that all functions are Sloo-measurable. 
One can easily see that 



Y, A /= E A fc = E n -E m . 



IGC m<k<n 

m<rk(/)<rz 

Note that for any / G L p (we assumed here that all the functions are Sl-oo-measurable) 

^nf — > f as n — > +oo 

where the convergence is a.e. (for p € [1, oo]), and in the L p norm for p £ [1, oo). 

To compute the limit E m / as m — > — oo, we notice that for a bounded compactly supported 
/ we can estimate \E f\ < C/\I\, so if \I n \ — > oo as n — > oo, then for such functions and for 
pe (l,oo] 

lim ||E 7 /|| p = 0. 

Since bounded compactly supported functions are dense in L p , p € [1, oo), and operators E n 
are contractions in L p , we get applying e/3 Theorem, that for / E L p , p < oo 



E_ n /^ Y E if as n 



oo, 



/ear 



0,fin 

— oo 



where the convergence is in LP for p £ (1, oo) and in a weaker sense (say L 1 convergence on 
compacts) for p = 1 . 

Therefore any function / S L p , p € (1, oo) can be represented as LP convergent series 

(i-3) / = E A //+ E v = j> fc /+ E v 

= ^A fc / + A_ 00 /; 
fcez 
we use the notation A_oo = E_oo := ^ /e2l o,Hn E^ here. 

We had shown the convergence of partial sums £^, but in fact the convergence of the 
series is unconditional (independent of ordering). 

1.3. Martingale Hardy spaces. Everything in this subsection is well known, we present it 
only for the convenience of the reader. 

Let us recall the classical result by D. Burkholder, which in our notation can be stated as 
follows. 

Theorem 1.1 (D. Burkholder). Let f, g be two locally integrable functions on X such that 
a.e. on X 

|A 7 /| < |A 7 #| V/ G C, and \E T f\ < |E /5 | MI £ 21°^, |/| < oo. 

Then 

II/II p <(p*-i)NIp, 
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where p* = max{p,p'}, 1/p + 1/p' = 1. 

In [2] this theorem was proved for arbitrary discrete time martingales, which immediately 
gives the above theorem in the special case |X| = 1, 21^ = {X} for k < 0. The general 
statement can be easily obtained from this special case by easy and standard reasoning, 
which we skip. 

Burkholder's theorem implies that for \a k \ = 1 



£ll/lp< 



fcg»j{-oo} 



<C||/I 



V 



where C = p* — 1. 

Taking for a& independent Bernoulli random variables, taking values ±1 with probability 
1/2, and taking expectation one gets 

P dxdP(oj) < CP\\f\\ p , 
fcezu{-oo} 



T&^-hL ^ a ^)^ k f{x) 



Changing order of integration, and noticing that by Khinchine inequality for any sequence of 
x k £ C, the averages 

\ 1/p 



^2a k (u)x k 



dP(uj) 



and 



/ ^u k (u)x k 



2 \V2 

dP(w) 



£ 



£fc| 



1/2 



\ 1/2 

E i A */o»oi a ) > 

7i if — l / 



are equivalent with constants depending only on p, we can see that the quantity ||5/|| p , where 
Sf is the so-called extended square function 

1/2 

(1.4) Sf(x) = f 

^fc€ZU{-oo} 

defines an equivalent norm in IP (recall that we assume that all functions are 2loo measurable) . 

In particular, this implies that for f £ L p the sum in fjl .31) converges unconditionally 
(independently of ordering) in LP ' . Note, that if for a formal sum / of form (II. 3D we have 
Sf £ L p , then the series converges unconditionally in L p , so L p , p € (l,oo) is isomorphic to 
the set of formal series (jl.3p with 5"/ € L p 

Let us also introduce the classical square function S, where we do not add the term 

|A-oo/| 2 , 

(1-5) Sf(x)=(^\^kf(x)A . 



The situation for p = 1 is more interesting. Recall the classical result of Burges Davis 
[4| comparing maximal function with the square function. Let us recall that the maximal 
function M = Mg is defined by 



Mf(x):= sup |E /|=sup|E fc /(x)| 

l£C:x£l fceZ 
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Theorem 1.2 (B. Davis, 1970). Let M = Mc be the maximal function defined above, and 
let S(f) be the square function defined by (jl.4p . Then 

^WMfW^WSfh^CWMfh, 

where C is an absolute constant. 

Remark. The theorem in [1] was proved for general discrete time martingales, and in our case 
it can be directly applied in to the situation |X| = 1, 21& = {X} for k < 0, E„/ = 0. However, 
the general case can be easily obtained from here by a standard reasoning, which we skip 
here. 

Note, that by the Lebesgue differentiation theorem ||/||i < ||M/||i. Therefore, if ||5/ n ||i — > 
0, then || / n ||i — > 0, so if Sf G L 1 , then the martingale difference decomposition (jl.3p converges 
unconditionally in L 1 . 

Definition. The martingale extended Hardy space H 1 is set of all functions / G L 1 such 
that Sf G L 1 (equivalently, M f G L 1 ), equipped with the norm ||/|| _ = ||5/||i. 

H 

The Hardy space H 1 consists of all the functions in H 1 such that E,/ = for all I G 21 jj^ 
(with the norm given by US'/Hi)- Note, that ||M/||i also gives an equivalent norm on H 1 . 

Remark. For p G (1, oo) the extended martingale Hardy space H p is also defined as the space 
of all locally integrable functions / such that Sf G L p , with the norm ||/||^ p = ||5/|| p . 
While, as we discussed above, for p G (1, oo) the space H p is isomorphic to LP, we will use 
the notation H p as well (for example, to emphasize that we are using a different norm). 

Finally, the spaces H p are defined as subspaces of H p consisting of functions / such that 
EJ = for all / G 21°^. 

1.4. Martingale transforms and martingale multipliers. Let D T := A,L 2 . A martin- 
gale transform is a linear transformation T 

where T, : Dj — >• Dj. We also assume that T¥, T f = for all I G SlJ^. 

Such operators are well defined for finite sums: for now we will not assume the boundedness 
of T. 

If all operators TV are multiples of identity, the corresponding martingale transform is 
called a martingale multiplier. 

1.5. Paraproducts. For a function b let us consider the multiplication operator Mb, M^f = 
bf. We do not assume here that M& is bounded in L? (i.e. that b G L°°. For our purposes, it is 
enough to assume that b G L\ oc , so (Mbf,g) is well defined for / and g with finite martingale 
decompositions, i.e. for finite sums 

(i.6) / = Ev+ E V' 9 = Y, A i9+ E E i9- 
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1.5.1. The "infinite measure" case. Let us first consider the situation when StJ^ = 0. 

In this case, as it was discussed above, the space L 2 is decomposed in the orthogonal sum 
of subspaces Dj, I G C. 

Consider the decomposition of the operator M b in this orthogonal basis 

M b f = Y,Y, A i MbA jf- 

ieCJeC 

This sum can be split into 3 parts: over I Q J, J Q I and I = J respectively. 

The first sum is called the paraproduct and is denoted as 7Tb/; the corresponding operator 
ir b is also called the paraproduct. Since for I C J 

A I (bA J f) = (A I b)(A J f), 

we can write 

(i.7) 7r b f= Yl A /( feA j/)= E (V)( a j/) = E(V)( E //); 

/,Je£:/gJ I,J€£:ICJ I^C 

the last equality follows from the fact that for fixed I £ C, 



E ( A jf)h=^f- 



The second sum (over J Q I) is iv b f, where 7rJ is the dual of 7Tb with respect to the standard 
linear duality (/, g) = J fg. This can be easily seen from the fact that (K f f,g) = (/, E.g) 
and so (A T f,g) = (f,A T g). 

The third sum (over I = J) is the "diagonal" term denoted as A&/. It is easy to see that 

A b f = ^ / A I (bA I f) 
iec 

This diagonal term commutes with all martingale multipliers, so it can be ignored when one 
studies commutators of M b with martingale multipliers. 

In the situation when all intervals I £ £ have at most 2 children, any martingale transform 
is a multiplier, so in this case it is enough to consider decomposition of M b as 

(1.8) M b = TT b + n* b + A b ; 

where we can ignore the term A^ when studying commutators with martingale transforms. 

In a general situation, we can only ignore a term that is a martingale multiplier, so a 
different decomposition is needed. To present this decomposition we need the following 
lemma, which gives us a formula for tt^. 

Lemma 1.3. The (formal) dual n^ of ir b with respect to the standard linear duality is given 
by 



, ,, -T- 



/ = X;E / ((A J 6)(A J /)) 



iec 



= J2^ T ((b ~ E 7 6)(A 7 /)) = £E 7 (6(A 7 /)) . 

iec iec 

The word "formal" here means that the equality {TT b f,g) = (f,^ b g) holds for all finite sums 
f = Eiec A if> 9 = E/ £ £ A /5 . 
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Proof. It is easy to see that (Erf, g) = (f,E T g), and therefore so {Ajf,g) = (f,A T g). Using 
these identities and the fact that (A,fc)(E,/) = A T (bE r f), we get 



(TT b f,g) = Y,( A i( bE if)>9) 

= E( bE if> A i9) 

lee 
= £<V,6A /5 > 



= ^</,E 7 (6A /5 )> 

To complete the proof it remains to show that 

E I (bA I g)=E I {(b-E I b)(A I g))=E I ((A J 6)(A j5 )) , 
which we leave as an exercise for the reader. □ 

To give an alternative (to (jl.8p ) decomposition of Mf, let us notice that 
A I (bAJ) = A I ((b - E J 6)A J /) + (E J 6)A J / 
= A I {(A I b)A I f) + (E I b)A I f. 
Therefore, we can decompose A^ = A^ + A^, where 
(1-9) A fe 1 / = ^A / [(A / 6)(A / /)], 

(1-10) A°/ = ^(E / 6)(A / /). 

iec 

Note, that A® is a martingale multiplier, so it commutes with all martingale transforms. 
Defining 

(1.11) vrW^ + A, 1 , 

we can decompose the multiplication operator Mb, Mbf := bf as 

(1.12) M 6 = 7r 6 + 7r^+A0, 
Lemma 1.4. 

(1-13) 4* ) f = ^2(A I b)(A I f). 

iec 

Proof. Notice that 

(A J 6)(A J /) = E 7 ((A 7 6)(A X /)) + A, ((A 7 6)(A 7 /)) . 

Taking the sum over all I € C we get in the right side n^f + A^f, which proves the lemma. □ 

There is an alternative, probably a more natural way to get the decomposition fll.12}) . 
Namely, let us consider the product bf, which can be written as 

I,J£C 

(let us not worry about convergence here and assume that the sums in the martingale differ- 
ence decompositions of / and b are finite). 
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Let us split the above sum into 3 parts, over the sets I ^ J, J ^ I and I = J respectively. 
The first sum gives us the paraproduct irbf 

(1-14) Yl (A / 6)(A J /) = ^(A / 6)(E / /)=:vr fe /, 

Cf. (HZD- 

The second sum (over J ^ I) can be written as irjb, so using (|1.7p with / and b interchanged 
and recalling the definition of A°, see (jl.lOp . we get 

Y (V)(A j /) = ^(6 j )(A j /)=:A°/. 

Finally, the last sum gives us 

^(A 7 6)(A 7 /) =: vrW/, 
lee 
see (fTT3D . 

Remark. Note, that if £ is the standard dyadic lattice, then E ((A i)(A /)) = (A 6)(A /)), 
so TT b * = 7r£. This fact was used, for example, in [I]. 

1.5.2. Paraproducts in general case. Let us now consider the general case, when SlJ^ ^ 0. 
Consider the decompositions 

/ = E A //+ E v. s = E A /2+ E v. 

/e£ /-pa '" 11 Je£ 7e2r°' fln 

and let us decompose (bf,g). Note that for a fixed / G C 

b ( E A jf + E IE / /),A /5 ) = ((A / 6)E / /,A /5 } = (n/,A 7 < 7 }, 

where, as above 

(1.15) 7r 6 /:=^(E J /)(A / 6). 

ieC 

Similarly, 

bAjfA Y A J+ E ^ I g)) = (A I f,(A I b)E I g) = {A I f,7T b g). 

\JG£:J3/ jG2l (),fln :jD/ / 

As we discussed above 

Y(b& I f,A I g) = (A b f,g), 

i&c 
where 

(1.16) Atf := AfiAjf). 

The only terms in (/, g) that we did not count yet, are the terms with /, J E 2lJo^, which 
give us the remainder 

Y (bE I f,E I g) = ( Y (E / 6)E / /,E /5 )=:(i? 6 / )3 ). 



/e2t 0,fin /e2l 0,fln 
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So, the multiplication operator M b can be decomposed as 

M b = TTft + A 6 + 7T 6 + R b , 
where the paraproduct n b is defined by (|1.15|) . 7r£ is its adjoint, A b is defined by (|1.16p . and 



(1.17) 



« b / = (E_ 00 6)(E_ 00 /)= Y, ( E i b )( E if)- 



im 



O.fln 



Note, that Lemma 11.31 remains true in the general case as well: the proof is exactly the 
same. Also, nothing changes in the decomposition A b = A b + A^, because we can investigate 
this decomposition separately in each block D T , and these blocks know nothing about ^J^. 
Finally, the proof of Lemma 11.41 works in the general case without any changes. 

Summarizing we can state the following proposition. 

Proposition 1.5. The multiplication operator M b is represented (at least formally) as 

M b -R b = 7rJ* } + A° b +ir b = ir* b +A b + ir b = vr b * + A° b + A^ + ir b 
where 

A b f = J2& I (bA I f), A 6 = Ag + A£, 

Ag/ = J^(E J 6)(A J /), 

iec 

A 1 b f = ^A I [(A I b)(A I f)]. 
iec 

2. TRIEBEL-LlZORKIN TYPE SPACES. 

This part is devoted to the investigation of the "coefficient space" of the spaces H p . We 
are mostly interested in the spaces with q = 2, but since since the result for q ^ 2 are often 
obtained with little or no extra effort, we consider the case of general q here. 

The notation gp is chosen by the analogy with the notation ip' q for Triebel-Lizorkin 
spaces, see for example [B]. We use a different scaling here, so to avoid the confusion we use 
the different notation. Also, we do not use smoothness parameter a (we do not need it in 
what follows, and frankly, it is not completely clear what should be the correct smoothness 
in the general non- homogeneous case). For the standard dyadic lattices in M rf our spaces gp 
are isomorphic to { p ' q , with isomorphism given by rescaling of the entries. 

2.1. Triebel-Lizorkin type spaces gj,(£). Let £ be a lattice. For 1 < p, q < oo define the 
sequence spaces gp(£) consisting of sequences s = {sj}i^c such that 



\s Us 



s q P (c) 



X>|%) 

iec y 



1/9 



LP 



For p = oo the norm is defined using BMOdike norm 



8 A9 



&(C) ■= -P Tj 



/e£,/cJ 
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Formally, one can define the whole scale of spaces g^ (-£), 1 < r < oo, 

1/r 



: =-pfm/( E 



|a|l *& w w := 5S?llJF./A. IjJ^ 1 ^^ 



but it will be shown later that the norms are equivalent for 1 < r < oo. 

To shorten the notation, we will omit C and use the notation gp instead of gj,(£), when it 
is clear from the context what the lattice C is. 

The spaces gj>(£) can be naturally identified with the subspaces of L p {£ q ) (L p with values 
in £ q ). Namely, for a sequence s = {s 7 }/ e £ define functions 

fk= J2 Sjij, kez, 

J€£:rk(J)=fc 

and let 

f(x,k) = f k (x), kez, xeR. 

Then clearly, for 1 < p, q < oo 

W s h q p (c) = II/IIlp(^) := f / \\f(x, -)\\%dx\ . 

Thus, the space g$(£), 1 < P, <Z < oo can be naturally identified with the subspace of 
L p (£ q ) consisting of functions / such that f(-,k) is constant on intervals I G C, rk(7) = A;, 
and such that f(x,k) = if there is no interval I € C, rk(I) = k containing x (recall that 
rk(J) is the largest integer k such that / € Ck, so the condition I £ Ck does not mean that 
rk(/) = k). 

We will routinely switch between the function and sequence representation of elements of 
g|>, so / £ gp as a sequence {/r}r p£ or as the corresponding function /(-,•)£ L p (£ q ). 

We will also need the notion of the coordinate projection of / € gj,. Namely, for £ C £ 
define the coordinate projection /„ by 

(2- 1 ) fe = Wiee 

(meaning that entries corresponding to I £ £ are 0). In the function representation this can 
be written as 

(2.2) / £ (-, *)=/(',*)•( E h)> fc€Z - 

/6£:rk(/)=fc 

For / € L p (£ q ) define the vector Hardy-Littlewood maximal function /* 

f*(x,k) = sup — / \f(s,k)\ds. 

I3x l-M J/ 

We will need the following well-known theorem 
Theorem 2.1 (Fefferman-Stein, [5]). lei / E L p (^) ; 1 < p,g < oo. Then 

11/ ||lp(#) < C||/IIlp(£9), 
where C depends only on p and q. 
The following fact is well known. 
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Proposition 2.2. For 1 < p, q < oo the dual space (gp(£))* is isomorphic to ^,{C), where 
\/p + 1/p' = 1, 1/q + 1/q' = 1, and the pairing is the standard one 

(2-3) (f,g)= fY,f^ k Mx,k)dx= fY.fi9i 1 i = Y.fi9i\ I \- 

, ' x k , ' x /e£ lee 

Note that only claim that the norm in g, q ,(£) is equivalent to the norm in the dual space 
(except the trivial case p = q = 2 when the norms coincide). 

For the sake of completeness we present the proof of this proposition. 

Proof. Since (L p (£ q ))* = LP (£ q ) (for 1 < p, q < oo), any g £ gl,(£) define a bounded linear 
functional L on gp(£), and ||L|| < \\g\\ .q' lrV 

On the other hand, if L is a bounded linear functional on gp(£), it can be extended by 
Hahn-Banach Theorem to a bounded linear functional on L p (£ q ), which can be represented 
by a function g £ LP (£ q ), 

L(f) = / E /( x < k ^ x > k)dx, V/ G g|(£) 

k 

Note that functional L will not change if we replace the function g by its "orthogonal" 
projection g onto gp,(£), 

i u\ _ / 1-^1 X fid( s ' k)ds, if rk(7) = k, and x € I 

S(x,fc) - | Q if ,9/9 2, rk(I) =fc. 

Clearly |g| < (g)*, so by the Fefferman-Stein maximal theorem (Theorem 12. 1|) 



□ 



Dual of gj' is given by the following theorem. 



Theorem 2.3. Let 1 < q < oo. Then the spaces goo , 1 < r < oo do noi depend on r, and 
the corresponding norms are equivalent. Moreover, the dual space (gf)* is isomorphic to g^,; 
here again 1/q + 1/V = 1 and the pairing is given by (|2.3p . 

Proof. Let g £ g^, . We want to show that 

|(/,5)l<C||/|| i? || 5 |U (1) . 

It is sufficient to prove this inequality on a dense set of functions / for which the corre- 
sponding sequence {sA r ~ has finitely many non-zero terms. 

Let E k := {x £R: \\f(x, ■ )\\ tq > 2 k }, and let £ k := {/ G C : I C E k }. Note than E k is a 
finite disjoint union of maximal intervals I € £ k , maximal meaning that there is no interval 
in £ k for which / is a proper subinterval. 

One can easily see (see Fig. Q] )that 

Y,2 k \E k \<2 f\\f(x, -)h q dx. 
kez J 

Since £ is a disjoint union of the sets £ k \ £ k +i we write 

* = 1^ *s k \e k+1 



1-1 
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Figure 1. Level sets 



where the functions f c , c are defined in the sequence representation / = \f T } T ~ r by 

Note that 

(2.4) ||4 w (x,.)ll„<2^1, fc (x). 

Indeed, the estimate for x ^ -Efc+i is trivial. For x £ -Efc+i let J be the maximal interval in 
£k+i containing x. Then for this x 

(2-5) Il/„.v,..>,-)ll2.= 



£fc\£tH 



E i//i 



ie£ k -.QJ 



Let J be the parent of J. We assume that J £ £*., because otherwise the sum is trivial and 

fc\c = on I. Then for any y € J 

tfe\tfc+i 



(2.6) 



E l/,l' < ii/ £A&+ > • 



l€£ k --I^J 



(the right side contain all the terms from the left side, plus probably some other terms). 
But since J <£ £k+i, there exists y G J such that y ^ ^fc+i, and so 

Together with (|2.6|) and (|2.5p this inequality implies ([2 



COMMUTATORS, PARAPRODUCTS AND BMO IN NON-HOMOGENEOUS MARTINGALE SETTINGS 15 

To finish the proof let us take g € goo , \\g\\ , < 1 and estimate 

So© 

K/>S)I = \JY<fi9ih dx \ ^ E / E 1/jllSjlM* 
i&c k& J ie£ k \£ k+1 

- 1^ ^£ k \£ k+1 h°°(i«) W 9 £ k \£ k+1 \i (eq ') 

fcez 
<^2 fc+1 |^|<4||/|| L1( , g) . 



Here in the third line we used the fact that by (12.41) we have II f c , c II roo/ ,„, < 2 k+1 and 

J * " " J £ k \£ k+1 "L °°(£«) — 

that llo II < II ol. II . < \E k \. 

So, we have proved that a function g € goo defines a bounded linear functional of g^. 
Let now ip be a bounded linear functional on g^. By Hahn-Banach Theorem it can be 
extended to a functional on L 1 (^ <7 ), so it can be represented as 

</>(/) = (/,?) = / ^2f(x,k)g(x,k)dx 

^ kez 

where oG L°°(£ q '), \\g\\ , =\\<p\\. 

Let g = Vseq(g) be the projection of 5 onto the space of sequences, i.e. let the function g 
is given in sequence representation by 



9 1 = T f(x,rk(I))dx. 



The projection V se q, as it can be easily shown, is not bounded in L ca (£ q ), but by Fefferman- 
Stein theorem (Theorem I2.1|) it is bounded in U(£ q ). 

Therefore, for any (finite union of intervals) Bel and the collection £ := {I € C : I C E} 

11/7 ll r < CWTi \\ r < nWWYn \\ r 

H%Hl''(W) - < - y ll%llL''(W) - ° 1-^111% Hl°°(^') 

which means exactly that g G goo , ||g|| v.w < C||</>||- □ 

2.2. Embedding theorem for g|. Let {a,}, . be a collection of numbers. We are inter- 
ested when the operator 

/^K(/) 7 } /e£ 

is a bounded operator from LP to gp; recall that for a function / the symbol (/}. denotes it 
average, (/) 7 = fc f. 

The answer to this question is well known if p = q, is is given by the famous Carleson 
Embedding Theorem, that says that a necessary and sufficient condition to the boundedness 

is 



1 z ^ 



sup 7-7 > la 7 \ q < 00 

l/r \I\ Z -^ J 



which means exactly that {a, }, - € goo = goo- This result is especially well known for 
p = q = 2; the situation for p = q can be obtained, as it will be shown below, by the standard 
comparison with maximal function. 



II, 
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Below, we will show that the answer is the same for all p G (1, oo). For p = 1 the above 
condition is not sufficient for the embedding, but if we replace L 1 by H , then the result can 
be extended to p = 1. 

Theorem 2.4. Let p G [1, oo), q G (1, oo) and let a = { a A jG r ^ e a collection of numbers. 
Then the operator A a 

A a f = {a I {f) I } IeC 

is a bounded operator H p — > gp if and only if a G g^o • 

Proof. To prove the necessity of the condition a G gJL we just have to test the embedding 
operator on the functions l f , I 6 £. Since (1^), = 1 for J C I", the boundedness of the 
operator A a implies 

v 



E «A 



JeC-.Jd 



<C\\1 



i"p 



c\i\, 



which means a G gc» = gL> (by Theorem 12.31 the space g& does not depend on p) . 
Let us now prove sufficiency. Let 

E k = {xeX:\M c f(x)\>2 k }, 



and 



£ k = {I G C : / C E k }. 



We can write 



A a f = ^(A a f) 



fcez 



£k\£k + l ' 



where (A a f) , denotes the coordinate projection (|2.ip of A a f with £ = £ k \ £ k +i- 

£"fc\£-fc+i 

Assume that ||a|L« < 1, and let a . be the corresponding coordinate projection (12.11) 

of a. Since |Er/| < \M„f\ < 2 k+1 on / G £ k \ £fc+i> we conclude that 



(Aaf) 



£k\£k-{ 



LP {li) 



< 2 {k+l ^\ 



a 



£ k \£ k+1 "LP(ii) 



<2" +1 \E k \. 



Therefore, 

(2.7) E||(^/) 



A-el 



£k\£k + 1 



LP (11) 



<^2 2{k+1)p \ E k\< c \\ M cf\\p^ c ^ 



HP 



which is exactly what we need if p = g. 

The case p < q is also easy. We can write (in the functional representation) 



\ A »f\\ P LP{eg) 



Ell^w) 



£k\£k + 



(*,-) 



^/(E|(^/) 



fcez 



E K^W 



£fc\£fc+l 



(*,-) 



9 \ 9 



f9 



P \ P 



dx 



dx 



kez 



LP (li) 



<c 



HP 



because ||s|L, < ||s|L P for p < q 
by dZZD 



The case p > q is a bit more complicated. To treat this case let us first make few sim- 
plifications. Of course, without loss of generality we can assume that / > and that all 
Uj >0. 
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Lemma 2.5. Let f € L p , 1 < p < oo, / > 0. There exists a function f , f > f , \\f\\ P < 
C p || f\\p such that 



Mf < C' p f, 
where M is the Hardy-Littlewood maximal function. 

The condition Mf < Cf for / > is the so-called Muckenhoupt (^4i) condition. It implies, 
in particular, that for any interval / 

f f < 77 min/(x) 

J I G xdl 

Proof of Lemma \2.5[ Define / 

J := Y^ 7 k M k f, 

k>0 

where M k is feth iteration of M, and 7 > is sufficiently small, so that 

1, 



j\\Mh\\ p < -\\h\\ p Vh£L p . 



□ 



Replacing / by / from Lemma [2 .51 we can assume without loss of generality that Mf < Cf. 
It is an easy exercise with Holder inequality and Resonance Lemma (the fact that equality 
in Holder inequality is attained) to see that if p > q and 1/p + 1/r = 1/q, then 

\\F\\ p = sup{\\Fg\\ q :g€L r ,\\g\\ r < 1}. 

Take g € L r , g > 0, \\g\\ r < 1. Since clearly r > q, applying Lemma 12.51 to g q with the 
exponent s = r/q for p, we get a function g > g such that M(g q ) < Cg q and 

\\-X\\r _ 1 1 ~<? 1 1 s < r'lln'?ll s — r?llnll r 
\\9\\r — \\S lis ^ ^\\9 lis — Mlffllr- 

So, replacing 5 by g we can assume without loss of generality that Mg < Cg and \\g\\ r < C. 
To complete the proof, let us first notice that 

(2-8) \\A a (f9)\\ LHiq) < C\\fg\\ q < C\\f\\ p \\g\\ r < C'\\f\\ p . 

This inequality follows from the case p = q we discussed above. We used here the fact that 
II all , . < II all , , , which follows immediately from Holder inequality; note that we do not 

II II .q,(q) — " " -q,(p) ' J ^ J ' 

600 &OO 

need here the full equivalence of g^, -norms for all p. 



So, in light of (I2.8p . we only need to show that 

\\(A a f)g\\ Lq{fq) <C\\A a (fg)\\ Lq{iq) 
which follows immediately if the estimate 

m^^Cifg)]. 
j 1 

holds uniformly for all I G C 
We know that 

(2.9) min f(x) <{f) T <C min f(x) 

x£l 1 x£l 

and that 

(2.10) (g q ) T < Cmmg q (x) = C(ming(x)) q < C{g) q 

i£/ x&I J 
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and therefore 

r \{f) J g\ q = {f) 9 J {^) I 



<C{f)]{g)] bydHUD 

<Cmmf{x) q {g) q r bv (1231) 

x&I 1 

< C(fg)* 



3. H P a AND BMO a SPACES 



□ 



Most of result of this section are well known, and are presented there only for the conve- 
nience of the reader. However, I believe some proofs are new. 

3.1. Hq spaces. As we had discussed in Section [L3l LP norm of a function f,p£ (1, oo) 
(H 1 norm if p = 1) is equivalent to LP norm of the square function Sf. 

Acting by analogy, one can use u q- function" instead, and consider the space Hq, p,q G 
[l,oo] of formal martingale difference decompositions, such that 

(3-i) ii/ii*>:= |(Ei a //i ? + £ ivr Nl/ffl 

lee /e2i°' fin 



\i 



< oo. 

LP 



We assume here that the "martingale differences" A,/ are simply some functions h r , constant 
on children of I and such that J x h f dx = 0. The functions E T f are just some multiples of 1,. 
While we do not assume that all h f are martingale differences for some function /, we will 
still use notation A , meaning by / the whole collection of such "martingale differences" o 

We can also consider the space Hq, consisting of formal martingale difference decomposi- 
tions for which E, = for all I € SlJ^- 

The spaces T^and Hq are clearly Banach spaces as closed subspaces of L p (£ q ). 

As we discussed above in Section [L3l if / € ^|» then the series converges to a function in 
LP (to a function in H 1 if p = 1), and for p € (1, oo) the norm ||/|| „ is equivalent to the 

standard LP norm. 

Since ||x||^a < ||^||^ for q G [1,2], any formal martingale decomposition / G Hq, q G [1,2] 
converges to a function in LP . Thus we can identify in this case the spaces Hq and Hq with 
function spaces, which we denote Hq and Hq respectively. 

For q > 2 convergence is not clear, so in this case we only consider the spaces Hq and Hq 
of formal martingale differences. 

Remark 3.1. Informally, we can say that / G Hq if {A,/}, . G g|>. We are saying "infor- 
mally" here, because A . are not numbers but functions, so we have to interpret the sequence 
{A 7 /} 7 f, as the sequence of numbers. 

In this paper we will interpret this by saying that each A,/ defines entries x ,, J G child(I), 
where x T is simply the value of A , / on J. 



Such notation is partially justified by the fact, that in the essential case when our collection has only 
finitely many non-zero terms, all A . are martingale differences of the function ^2 Ie£ A f / 
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The space Ti^ is the classical martingale H 1 space, and we often will skip index q dealing 
with the case q = 2. Spaces 7i q we studied by many authors, for example they were the 
spaces H p q considered in [15] (S in the superscript stands for "square function"). 

Remark. There are alternative ways of obtaining entries x T from the martingale differences 

A,. For example, one puts Xj := (E^A^/I 9 ) , the condition {x [ }j e c G gp, define a 
martingale Hardy space what is denoted H p q in |15| (note that the supescript here is s q , 
unlike S q in the previous paragraph. 

For the lattices of homogeneous type it is not hard to show that this definition is equivalent 
to the first one. It is also well known and will be seen from what follows, that in the general, 
non-homogeneous case, the spaces can be different. 

3.2. BMO spaces and i/ 1 -BMO duality. We want to define BMO spaces, so we have the 
i^-BMO duality, as usual. 

Definition. We say that a formal martingale difference decomposition / = ^j^A,/ be- 
longs to the space BM.O q , q G (l,oo) if the sequence {xj} Ie ri obtained from {A 7 /} 7 . as 
in Remark 13. 11 belongs to goo- 

The extended BMO spaces BM.O~ are obtained by adding to the formal / G BM.O q 
additional terms __ 

£ V> ll^/Hoo < C < oo V/€2l°_4 n . 

/e21 0,fln 



We can rewrite the definition of BMO q by picking r G [l,oo) (recall, than defining space 

goo we first defined the spaces goo and and then had shown that all g^ -norms are equiv- 
alent) and saying that / G BM.O q if for any / G C 

(3.2) U £ \&jf\ q y r dx<C<oo 

(uniformly in I), and, in addition 

(3.3) sup||A /||oo < oo. 

lee 

Remark. One would expect, that the condition (|3.2p alone defines the space BA40 q , but it 
was known for a long time, that the additional condition is needed. One can look, for example 
at the 1973 Garsia's book [7j where the BM.O2 space was defined. One can easily see that 
the definition from [7j is equivalent to the one presented here. 

If |X| < co, BM.O q C HI, so (see Section [3~7il above) for q G (1,2] the formal martingale 
difference decomposition / G BMO q converges to a function in U ' . So in the case |X| < co 
one can identify for q G [1, 2] the spaces BMO q and BMO~ with function spaces, which we 
will call BMOq and BMO q respectively. 

The following theorem is known, but the proof presented here is probably new. 

Theorem 3.2. The dual of the space T-L q (resp. H q ), 1 < q < 00 is the space BM.O q > 
(resp. BMOq,). 

Proof. We will prove the duality between 7i q and BA40 q ,, the duality between %]. and 
BMOZ follows trivially. 

The sufficiency of the condition g G BAiO q i for the boundedness of the linear functional 
/ ^ {f> 9} on T~t\ follows immediately from Theorem 12.31 



20 SERGEI TREIL 

To prove the necessity of this condition, let us note, that by the definition T-L\ can be 
identified with a subspace of g[ (by identifying the family {A f g} r . with an element in g* 
as described in Remark 13. 1|) . 

Thus a linear functional tp on Tih can be extended by Hahn-Banach theorem to a functional 

on g^, so by Theorem 12.31 there exists g = {9j} Ie r £ S°o, \\g\\ , < C\\p\\ such that the 
functional p is given by 

(3-4) tp{f) = [ E E \f(^9j dx = E / E \f( x )9jdx. 

Jx lGC J6child(7) l€£ X Jgchild(7) 

We would like to interpret the function X)jechiid(J) ^j^/ as a mar tingale difference, but this 
function does not have zero average. But since j^Ajfdx = 0, the integrals in the right side 
of (|3.4p do not change if we subtract from g ,, J € child(I) a constant c = c r 

Therefore, if for J S child(I) we define g, := 7jj — |/| _1 J2i'echild(l) 9p l^'I' we ^ e * ^ a t 



^(/) = E / E \f(*)9jdz. 



/G£ " A Jechild(I) 

But now the functions Sjechiidm fljlj have zero average, so we can treat them as martingale 
differences. 

■ a' 

Let us check that {9j} Ie r £ goo- Using the fact that the averaging operator / i-> (/) 7 1 7 
is a contraction in all L p , p £ [1, oo] (it follows immediately from Holder inequality), we can 
see that 

ii E SjIjW <2\\ E 9jK 

JGchild(7) Jechild(J) 

Using this inequality we get that for Iq E C 



E 

JeC:Jci 


W 


I'- 7 ! 


— 


K 


q 








< 


K 


q 








< 


K 


q 



i4i+ E II E »a 

iec-.ido Jechiid(J) 



q' 



«i + 2 '' E 1 E l,h 


q 


IeO.IClo Jechild(J) 


o| + 2 q '\I \ ■ II oil 9 ',, ,>. 

U| | U| IliMI 9 ',( 9 ') 
&00 





Noticing that \g T | < \\g\\ . 9 ',( 9 '), and therefore \g r \ < 2\\g\\ . 9 ',( 9 '), and taking into account 

') goo Soo 

that g^c norms are equivalent for all r € [1, oo), we conclude that ||{<7 A } rp ,. \\. q ' < C||<?|| .«' ■ 
Thus g = { 9l } IeC eBMO q/ . D 

3.3. BM.O q as function spaces. 

Proposition 3.3. For q € [1,2] i/ie space BAAO q can be identified with a function space, 
i.e. for each formal martingale decomposition f = ]C /g £ A,/ ^ BA40 q there exists a locally 
integrable function f such that for all I £ C 

A 7 / = Aj. 

A similar statement holds for spaces BA40~ as well. 
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Remark. As it can be seen from a simple example below, the martingale difference decompo- 
sition / = Yliec ^// ^ BMO g , q G (1, 2] does not necessarily converge if |X| = oo. 

Nemely, let £ be the standard dyadic lattice V in R, and let Ik = [0, 2 k ), k G N. Consider 
the formal martingale sum / = Y2fc=i ^r /j w here 

A /fc / = l[ 0j 2*-l) - l[2 fc -!,2 fc )- 

It is easy to see that / G BM.O q for all q G (1, oo), but the series clearly diverges. 

Proof of Proposition \3.3[ It is sufficient to analyze the convergence on each interval J G 21^00 
separately. 

If \J\ < oo, then the series /, := YlieC-icJ ^i-f belongs to H 2 ,. Therefore, as it was 
discussed before in Section 13.11 the series converges to an I? function. 

Let now consider J G 21?. oo such that \J\ = oo. It is not hard to see that any such interval 
can be represented as the union 

J = (J h, he child(/ fe+ i) Vk > l 
fc>i 

(note, that here k is not the number of generation). 
Let 

C(J) := {I G £ : I C J, / + I k Vk G N} 

so the collection {/ G C : I C J} is split into a disjoint union of £ and the set {Ik, : k G N}. 
For k = 2, 3, . . . let «& be the value of A, on Ik-i, aid let ao = 0. Define the function / 
on J by 

/~ : = E v+EK/-*- 1 . 

/e£(J) sgN 

Let us show that the sum restricted to any of the above intervals Ik converges in L 2 (Ik). This 
will immediately imply that A./ = A f f for all / G £, / C J. 

The second sum trivially converges, because A r / — a s l , = on Ik if s > k. 

Is J 

Let us show the convergence of the first sum. Note, that we only need to count the terms 
Ajf with / G £(J), I G Ik, because the terms with / G £(J), I <£ Ik are zero on /&. 
Condition / G ^.MOq implies that 

J2 A 7 / e Wj c «§, 

iec(j):ici h 
so the sum converges in L 2 (Ik). □ 

4. L p BOUNDS OF PARAPRODUCTS 

4.1. Martingale differences do not form a strong unconditional basis in H p in the 
non-homogeneous case. In |9] the notion of a strong unconditional basis was introduced. 
A system of nontrivial subspaces £j (of a Banach space X), j G J (where J is a some 
countable set) was called a strong unconditional basis if 

(i) The linear span £{¥,j : j G J7} is dense in X; 

(ii) There exists an ideal Banach space Y of sequences {cj} . „ and a constant A > 
such that for any sequence {^j} p7 , Xj G £j with finitely many non-zero elements 

{INI} ie<7 



All^- X '" <: 



A' 



jej A r jej 






22 



SERGEI TREIL 



Recall that a Banach space Y of sequences {cj} . „ of complex numbers is called ideal if 
for any sequence of factors ctj, |ay| < 1 the sequence {otjCj}.^^ € Y and |||a 



i J jeJ 



i c j}j 



£J"Y 



< 



II -Tr4 i 

Note, that a strong unconditional basis is an unconditional basis, meaning that any vector 
x £ X admits a unique representation 



x 



7 , x i' 



Xj € Cj, 



A, form an unconditional 



and the series converges unconditionally, i.e. independently of the ordering of J . 

One can easily see that the martingale difference spaces D T 
basis in H p , p G [1, oo). It is also well known that for a homogeneous lattice C the subspaces 
D form a strong unconditional basis. 

Unfortunately, as we demonstrate below, that is not the case in the general situation. 

If the system of the martingale difference spaces D f were a strong unconditional basis, one 
could guess that the natural "coefficient space" for H p should be the Triebel-Lizorkin type 
space gp. 

In other words, one could guess that one could get an equivalent norm in H p by replacing 
the functions A,/ in the square function by multiples of 1,. The norms have to be equivalent 
on singletons / = A /, so if one wants replace functions A / by c,l,, c, = c.(/), the 
functions A f f and c,l, should have equivalent LP norms (uniformly in I). 

If everything works when the norms of A / and c,l, are equivalent, it works when they 
are equal. So everything reduces to the question on whether the quantity 

(4.i) |(E( E /i A //i p ) 2/p ) 1/2 

lee 
gives an equivalent norm on H p . 

The answer is well known to be "yes" in the case when the lattice if of homogeneous type. 
In fact, in this case for q € [1, oo) the averages (EJA J 9 ) 1 ' 9 are equivalent, so one can replace 
A / by any of these averages (the case q = 2 is usually considered in the literature). 

In the general case, as the theorem below asserts, only "half" of necessary inequalities is 
holds, so the answer is unfortunately "no". 

Note, that Theorem 14.11 does not imply that the system of martingale difference spaces 
Dj is not a strong unconditional basis: it only implies that a particular norm on coefficient 
space does not give an equivalent norm. However, modifying the proof of Theorem 14.11 one 
can show that indeed the martingale difference spaces D f do not form a strong unconditional 
basis in H p . 

Theorem 4.1. Let f e H p . 

(i) For p G [1,2] the inequality 



(4.2) 



(X>zi A //r) 



2/p 



iec 



1/2 



H(Eivi : 

iec 



1/2 



(4.3) 



holds; here C = C p and does not depend on f and C 
(ii) For p £ [2, oo) the opposite inequality 

1/2 



iec 
holds with C = C„. 



Ei A //i 2 <q E( E /i A //i p ) 



\2/P 



n\ 1/2 



iec 
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(iii) For p G (2, oo) the inequality (|4,2p fails, i.e. for each p > 2 one can find a lattice C 
and f G H p such that the left side of (J4.2J) is infinite (while the right side is finite 
because f G H p ). 

(iv) For p G [1,2) the inequality f|4.3|) fails (in the same sense an in the statement (iii)j 



4.1.1. Proof of two estimates in Theorem \4-l\ To prove statement (i), let us consider the 
sequence {\A I f\ p } IC E g^, q = 2/p, where as in Remark 13.11 \Ajf\ p defines entries Xj, 
J G child(J), x T being the value of |A f\ p on J. Then the estimate (|4.2p follows from 
immediately from the boundedness of the averaging operator Av in g^, q = 2/p 

(Avx^i/r 1 y, x j\ j \- 

Jgchild(7) 

To prove that Av is bounded, let us notice that its adjoint Av* is the "forward shift" 

(Av* x)j = x~, I is a parent of I. 

We want to show that this operator is bounded in g^> = (gf)*. If x = {xj} It - c G g?o, then 
for J G C 

J2 \{A,*x) I \*l I = \x/l J + £ |*/l, 
iec-.icJ IGCJCJ 

where J is the "parent" of J. Since Ice J < ||x||.„/ and 

A I T I II II „** 

J &oo 



J -J T r- r . T/— T 



|cc|| , 

1-.L':1 ..I ^ 



we conclude that 



£( Y \(Av*x)/l I )dx<2\ 



, l X ll. 9 ') 

,J ieC-.icJ go ° 



which proves that Av* is bounded. Therefore Av is a bounded operator in g^, which proves 

Statement (ii) follows from (J4.2J) by duality. Namely, take g G H p , 1/p + 1/p' = 1, 
1 1 9 1 1 hp' — 1 an d estimate 



[ fgdx\<J2 [\ A if A i9\dx 

<Yf^ I \ A I f\ P ) 1/P ^ I \ A I 9\ P ') 1/P 'dx 
iec jl 

= fY^i\ A I f\ P f IP ^i\ A I 9\ P ') W d X 

J iec igc 

iec p iec 

By (02]) 

"^(^lA^r') 2 ^!,) 172 ,<C|H| Hr 



/e£ 



24 SERGEI TREIL 

So by taking supremum over g G H p , ||p||# P ' < 1 and taking into account that the dual of 
H p is isomorphic to H p , we get 

\\f\\H P <c\\(£^ I \^f\ p ) 2/p " 1/2 

which is exactly condition (ii). □ 

4.1.2. Counterexamples in Theorem \4-l\ To prove (hi), take Iq = [0,2). Fix n £ N, n > 2 
and let 

l k = [0,r k ), J k = [ r \r k - 1 ), 

where r = 1 — 1/n, k = 1, 2, ... n. Note that ifc-i is a disjoint union of ifc and Jk- 

We assume here that Ik,Jk £ £fc i we will only consider functions whose only non-zero 

martingale differences are A r /, k = 1,2, . . . ,n, so the other intervals in Ck are irrelevant 

for our construction. 

For k = 1,2, ... , n dehne 

A, / = 1, -al 7 

where a = l/(n — 1), so J x A, /dx = 0. 
We can estimate that on Iq 

k=0 fc=l v ; 

(each point x € Iq belongs to at most one of the intervals J^ , which contributes 1 to the sum, 
and each 1^ contributes a 2 ). Therefore 

n— 1 -, /n 

,1/2 



1 



1 /2 

(4-4) ||(El A 4/| 2 ) 

fc=0 


< 2 


On the other hand for x € Ik-i 


E h J\ 1 /I P = - + ( 1 - 1 A 


so for x £ I n 





" „ , / 1 \ 2 /p 



EKJ A ik.J\ p r p >-(-J » 



Since p > 2 we have n 1_2 ' p — >• oo as n — > oo, so by increasing n we can make the left side of 
(|4.2p as large as we want (because \I n \ = (1 — l/n) n > l/2e for sufficiently large n). But by 
(|4.4p the right side of (j4.2|) is uniformly bounded. Thus, the uniform (in all lattices) estimate 
(IQ) fails. 

Repeating the construction (with n — > oo) on disjoint intervals, we get a lattice where the 
uniform (in /) estimate (J4.2J) fails. But from here one can easily construct a function such 
that the right side of (|4.2p is finite, but the left side is infinite. 

The same construction allows us to prove statement (iv) as well. Namely, we can easily 
see that on J = U£ =1 Jk 

n—l -, ,„ 

fc=0 
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Note that \J\ = \ — r n = 1 — (1/n)", so for sufficiently large n, we can estimate that \J\ > f/2. 
Therefore 

n ~ l x 1/2 

IV /IT >2~ 1/p . 

On the other hand for x € Ifc-i 



71—1 

(E 



/' 



so for x £ It 



Therefore 



/ fc _i' Jfc-i'i n n ^ n / \n-l/ n 

EKJ \,/r) 2/p ^ (~f P = * 2/p - l - 2/p - 



fc=i 

n.-l 



fc=0 

because p < 2. 

So, for p G [1, 2) the uniform estimate (j4.3|) fails, and from here is is easy to get a function 
for which the right side is finite, but the left side is infinite. □ 

4.1.3. Not a strong unconditional basis. 

Proposition 4.2. There exist a lattice C such that the martingale difference spaces D f do 
not form a strong unconditional basis. 

This proposition also demonstrates, that unlike the case p = 2 the uniform boundedness 
in LP of the blocks T, of a martingale transform T does not imply the boundedness of T in 
D>,p^2. 

The proof of the Proposition can be obtained by modifying the construction in Section 
14.1.21 Define Iq = [0, 1). Fix n € N, n > 2. Let split Iq into two subintervals, I\ and J±, 
where \J\\ = (1/n) | Jo I ; so \h\ = (1 — l/ n )l^o|; and let us split intervals I\ J\ into two equal 
subintervals, let us call them, I±, </{% k = 1, 2. These four intervals will be children of Iq. 

For an interval /, let h T be the Haar function (normalized in L°°), 

where /+ and I_ are the right and left halves of I respectively. 
Define the martingale differences in D T , 

where as in Section 14.1.21 a = (n — l)" 1 , and /3 = n~ p (l + (n — l) 1_p ) , so 
(4-5) ||A /o /||p = ||A Jo5 || p . 

We then apply the same construction to the "children" l\ of Io, then to all "children" J^ 
of all Ii and so on. Note, that we do not care about the "children" of the "smaller" intervals 
J*, because we put the martingale differences to be zero for all intervals different from one 

So, we get the collection of intervals Ii and the corresponding martingale differences A - /, 

h 
A g, constructed the same way as in (|4.5p . 
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But now let us notice that this construction just models the construction from Section 
14.1.21 Namely, for every k € N, the total length of the intervals I 3 , is exactly the length of the 
interval L k from Section [4.1.21 It is easy to see that the function ^ • |A . /| and the function 

k 

| A, /| from Section [4.1.21 have the same distribution function. Moreover, the corresponding 
square functions 

ElA,/! 2 ) 1 ' 2 a„ d (ElA,/l^ 1/2 

k,j k k 

also have the same distribution function. 

The distribution functions of the square function 

(£'V' 2 " /2 

and of the function 



(E 



\ 2 /P\ 1/2 



k 

from Section 14.1.21 also coincide. Therefore, all estimate from Section 14.1.21 apply here, and 
repeating the reasoning from this section we prove the proposition. □ 

4.2. "Paraproduct" version of embedding theorem. Let b = {b 1 }, ^ be a family of 

functions such that b is supported on / and is constant on "children" of /. Define a "para- 
product type" operator n = tt^ by 

IeC:vk(I)=k 

If b 1 = A b for some scalar function b, this is just the classical paraproduct, so that is where 
our operator came from. However, we do not assume here orthogonality of b to constants, 
so b here is just a collection of functions b . 

We are interested when this operator is a bounded operator LP —> L p (£ q ) (or from W — > 
L p (£ q ), if we are interested in he case p = 1). 

If each b is constant on /, the answer is given by Theorem [23] above, and it does not depend 
on p € [1, oo). In the general case, if we do not assume that the lattice is homogeneous, the 
answer generally depends on p; one can easily come up with a counterexample in the simplest 
situation when only b with / in a disjoint family are non-zero. 

Theorem 4.3. Let p € [l,oo), q € (l,oo). The operator yff, defined above is a bounded 
operator H p — > L p (£ q ) if and only if 



(4.6) sup/( Yl W J {x)\ q Y qP dx = K p < oo 



jfci, j i j eC .j CI 

Moreover, the norm, of 7F& is estimated by CK, where C = C(p). 

Proof: necessity and the easy case p < q. The necessity of the condition (j4.6|) is trivial, one 
just needs to test the boundedness of 7T& on characteristic function l /; / € C an when 
computing the L p (£ 9 )-norm only count (/} ,b corresponding to J C L. 

To prove the sufficiency, let us first fix the notation. As in the proof of Theorem (|2.4|) let 
E k :={xeX: M f(x) > 2 k } and let 8 k := {L G C : L C E k }. 
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Let (jrbf) F \c- be the coordinate projection of TTbf corresponding to the set £ k \ £ k +i 
(recall, that by (|2.2p the coordinate projection can be defined for arbitrary function in L p (£ q ), 
not just for elements of gp). 

The sufficiency for p < q is proved absolutely the same way as in Theorem 12. 4t using 
absolutely the same reasoning as there, we get the analogue of (|2.7p . namely that 



kez 



LP(£i) "' " P ' 



(this inequality holds for all p,q £ (1, oo) with C = C(p, q)). 

This immediately gives the desired estimate if p < q the same way it was done in the proof 
of Theorem 12. 4t we really did not use the fact that a T were constants there, the estimate 
works for arbitrary functions. □ 

4.2.1. Stopping moments and the hard estimate in Theorem \4-3\ To treat the estimate in the 
situation when p > q we employ the stopping moment technique. 

First of all let us note that if p > q > 1, then p > 1, so the space W is isomorphic to L p . 
That means that without loss of generality we can assume / > 0, which we will do in what 
follows (note, that we cannot do that for Ti 1 ). 

So, let us assume / > 0, / £ LP. Fix some k$ £ Z (we later let ko — > — oo) and define 
the first generation Q\ of stopping intervals to be the collection of maximal (by inclusion) 
intervals / € £k (the sets £/. and E^ were defined above in the beginning of the proof of 
Theorem 14. 3p . 

We then construct the generations Qt of stopping intervals by induction, by taking for 
each J G Q\ some disjoint subintervals I £ C, I C J to get Q^ +v Namely, suppose we have 
generation Q% of stopping moments. 

Let 

G k = |J /, 

and denote Qy. = {I £ C : I C Gk} (so Q% is the collection of all maximal intervals in Qk). 
Also, for J £ £ let 

r(J) := max{/c £ Z : J C -E^} = max{/c £7L : J £ £/%}. 

For an interval J £ Qt we consider all maximal subintervals I £ C, I C J n E r+ 2, where 
r = r(J). The collection of such intervals constructed for all J £ Q^. is the generation Ql +1 
of stopping moments. 

It is easy to establish the following properties of stopping moments. 

Lemma 4.4. For any J £ Q^ 

(i) (f)j < y^ 1 ; 

(ii) (/) J > 2 r ( J ); 

(iii) For any I £ Q k \ Gk+i, I C J the estimate (f) T < 2 r(J)+2 holds; 
(iv) Finally 

\JnG k+l \ < -\j\ 

Proof. Property (i) holds because by the definition of r(J), we have J $£ E T rn + i. Property 
(ii) holds because by the construction J is a maximal subinterval of some Ej , and therefore it 
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is a maximal subinterval of E T rn (recall that r( J) > j). But since every E r can be represented 
as E r = U{J e C : (f)j > 2 r '} and / > 0, the inequality 

(4-7) {f) j > 2 r 

holds for all maximal sub intervals J of E r . Here we used the trivial fact that for a non- 
negative function the average over a union of sets is at least the minimum of averages over 
each set. 

Property (iii) holds because by the construction of Gl +1 for any such I we have / <f_ E r ^ +2 - 
Finally, any / E Gk+i' ^ C J is, by the construction, a maximal subinterval of E r n\ +2 , so 
by (|4.7p with r(J) + 2 instead of r, (/). > 2 r ' J > +2 . Since J n G^+i is a disjoint union of such 
intervals /, we get using condition (i) 

2 r(J)+1 | J\ > [ fdx> [ fdx > 2 r ( J ) +2 | J n Gfc+il 

J J JjnG k+1 

(we used the above estimate (/). > 2 r+2 for the last inequality), which gives us property 
(iv). " ' ' D 

We are now ready to prove the estimate. For an interval J G Ql let 

G(J):={IeC:ICJ,I^g k+1 } 
(which is exactly the collection of intervals / from the property (iii) of generations Q^ ) . Then 

oo 

&bf) £ = (nf) Ql = E E ^f) g(J y 

fc=l JG0* 

and each (?f&/) c , « is supported on J. So, taking g G LP , \\g\\ p > < 1 we can estimate 
(4.8) f \\(nf) Fi (x, 0IUs(*<EE / \\^ b f) c(J Jx,-)\Ug(x)\dx 



fc=UGg,* ,yj 






Each integral in the sum (over J G Qt) can be split 

...+ / ... = A(J) + B(J). 

J\Gk+i J Jr\Gk+i 

Let us estimate A(J). For J G Qji let us denote J := J \ Gfc+i- Note that the sets J are 
disjoint and U fc >i Uj ee * J = E ko . 

By the property (iii) of generations £/£, the inequality (/). < 2 T ( J > +2 holds for / G <7(J). 
Together with (|4.6p this gives us the estimate 

(4-9) ll(n/) g(/) || LP ^ ) <2 r ( J ) +2 |J| 1/p , 

so 

(4.10) A(J) < 4 • 2 r ( J )|J| 1/p ||5l~ll P ' < 2 1/p 4 • 2 r ( J )|J| 1 / p || 5 l^|| p /; 

the last inequality holds because by property (iv) of generations \J\ < 2|J| (recall that 

J = J \ Gk+i)- 

We know that by definition of r( J) 

2 r(J) <minM / ,/(x), 
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so using this inequality and disjointness of Js we get 



oo oo „ 

(4.11) J2 E 2r(J) ' P l J l ^ 2 E E 2V{J) ' P \J\ < 2 [M c f(x)] p dx < C 

k=i JeGT fc=i J&Qt x 



Again, since Js are disjoint, 



oo 

E E \\9ht' ^ w<- 

fc=i Jeg* 



So, applying Holder inequality to (|4,10p we get 

oo 

EE^^II/MMIp'- 

fc=i jee* 

To estimate the sum of B(J)s, let us notice that for J £ Q^ the function (j^bf) G(J) is 

constant on intervals / € ^ +1 , / C J, so the integral -B(J) does not change if we replace g 
there by the function g , , 

9 j ■= E ^J 1 !- 



Since 



we can estimate using (J4.9 



\9jK>= e iwvi 

ieg* k+v icJ 



W 

(4.12) 73(J) < 2 r ( J )+V| 1/p ( E \(9}j\ P '\I\" 

K ieg* k+1 ,ic.J 

The measure X^i S JeS* 1j^ x i s clearly Carleson, i.e. for any I £ C 

oo 

(4.i3) J2 E ^ c i J i- 

fc=lJ G g*:JC/ 

Indeed, since interval in each generation Q£ are disjoint, the desired inequality holds trivially 
if we consider only one generation in the sum, for example the generation Q^, with the smallest 
possible k, still intersecting /. By property (iv) of generations, the contribution of each next 
generation is at most half of the previous, so summing geometric series we get (|4.13p . 
Therefore, by the Carleson Embedding theorem (cf. Theorem 12.41 for p = q) we get that 

oo 

E E Mrfmzcwgw* 

fe=i /ee* +1 

Therefore, summing (|4.12j) over all J, then applying Holder inequality and using (|4.1ip . we 
get 

oo 

J2Y. B ( J ^ c \\fU9\\ P i- 

k=i Jeg; 



Ek 
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Combining this with the estimate for the sum of A(J)s, we get from (|4.8p 

(jtbf) £h {x, -)\\ eq \g(x)\dx < C\\f\\ p \\g\\ p >. 

Letting ko — > — oo concludes the proof. □ 

4.3. Bounds for paraproducts. We will need the following simple lemma. 

Lemma 4.5. Let I be a disjoint union of sets I\ and I2, and let h be a "Haar function" , i.e. 
h = ail, +02lr and J x hdx = 0. Then, assuming without loss of generality that \I\\ < /2I 
we get that for p G [1, 00) 

\\h\\P < oil/? 1 ll p 

and that the inverse Holder inequality holds 



lpl \h\\ P ' < 2\\h\\l l/ P +l/p' = l. 

Proof of Lemma \4-5\ Assume without loss of generality that |/i| < \l2\- The condition 
J x hdx = means that 

ai|il| = — CK2 1-^2 1 , 

which immediately implies |«2| > |cti | . Then 

/ \h\ p dx = |ai| p |/i| + |a 2 | p |/2| = \ui\ p \h\ + |a2| p-1 |aill J il < 2 \ a i\ P \h\ 
Jx 



so 



\\ p P <nhij\\ p P 



and similarly for p' . 

Since for constant functions Holder inequality becomes identity, using the above estimate 
we can write 



||p/ < 2||/il / ||p||/il / || p / = 2||/il 7 |[2 < 2\\h\\l 



D 



Theorem 4.6. Let b = {A,6}, „ be a martingale difference sequence, and let p € [1, oo), 
q € (1, 00). Then 

(i) The paraproduct iTb is a bounded operator from W to Tiq if and only if 

r — 

(4.14) sup-N Yl \&jKx)\ q y P dx=:KP<oo. 

Moreover 

K<\\n b \\~ <CK, 

where C = C(p, q). 
(ii) The paraproduct ir b * is a bounded operator in W = Ti^ if and only if b € BAiO = 
BM0 2 - Moreover 



1 



q\\"1\BMO - IIV \\fr->w> - C W b \\BMO 



where C = C(p). 
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Remark 4.7. For q = 2 statement (i) of the theorem describes the boundedness of the para- 
product ir b in W (or equivalently, in H p ). For p G (1, oo) this is equivalent to the boundedness 
of n b in LP . 

Note, that unlike the condition b G BMO, which is necessary and sufficient for the bound- 
edness of 7^ in ah 7~L P , the above condition (|4.14p (for q = 2, for example) does depend on 
V- 

Remark 4.8. Note that the condition (|4.14p (for p = q = 2) is weaker then the condition 
b G BMO. Since by Proposition 11.51 

M b = 7T b + 7T* b +A b + R b , 

and X b , R b commute with all martingale multipliers, the above theorem implies, in particular, 
that unlike the homogeneous case, it is impossible in general to characterize b G BMO via 
boundedness of the commutators of M b with martingale multipliers. 

Proof of Theorem \4-6\ The statement (i) is easy. The "only if" part and the estimate K < 
\\7r b \\ follow from testing the boundedness of ir b on functions 1,, / G C. The "if" part with 
the estimate ||7rf>|| < CK follow from Theorem 14.31 above. 
Let us prove statement (ii). Notice that by Proposition 11.51 

If b G BMO, we know that for any p' G (1, oo) 

r I ' 

sup-f( Y, \^jKx)\ 2 y P dx<C\\b\\V BM0 . 
iecJi JeC . JcI 

Taking p' to be the dual exponent to p, 1/p + \p' = 1, we get that by (i) n b is bounded in 
W , so by duality tx\ is bounded in H p . 
Since by Proposition 11.51 

(4.i5) k\f = J2 A i [( v)( A //)] = E( A / 6 )( A //) - Y, E A A M A if» 

iec i&c iec 

and by the definition of BMO we have ||A / 6|| 00 < ||6||bxo, we can conclude that A^ is 
bounded in W . Indeed, since HA^ftHoo < ||6||,b.m0 



f ^|(A / 6)(A / /)| p dx < \\b\\ p BMO f Y,\ A if\ Pdx 



\bMOUJ W-fjp- 

leC ,,A /e£ 



By Fefferman-Stein maximal theorem we get from this inequality 



^|E / ((A / 6)(A / /))| p dx<C7||6||] 



\BMo\\f\\-flp- 
iec 



So both sums in (|4.15|) can be estimated and we get that A^ is bounded in T-L p . 



Assume now that ir b is bounded in W p , so 



( 7r W)* =7r6 + (A i r = 7r6 + A i 

is bounded in T~L V . Testing this operator on functions l f and counting in the result only 
martingale differences with J C /, we get 



(4.16) sup-ff Yl \&jKx)?Y P dx<K p ' <oo, K 

I £LL J I 1 r~ P . 7 r~ T 



b ) \\W-W 
J&C-.JCI 
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By (i) this means that ir^ is bounded in 1~L P (with the norm at most CK), and so Aj is also 
bounded in W with the norm at most C\K. By duality, A^ is bounded in T~L P (with the 
same norm). 

Estimate (jiTTBj) also implies that ||A 7 6|| p / < K\I\ x / p ' . 

To prove that b € BAiO q ' it only remains to show that for all I € C 

||A J 6||oo < CK. 

Assume that ||A 7 6||oo > 2 1 ' P K, because otherwise we already have the desired estimate. 
Let J € child(I) be an interval where ||A ,&[|oo is attained. Then 

2K P \J\ < \\Ajb\UJ\ < ||A 7 6||^|/|, 

so \J\ < \I\/2. 

Define a test function h 

h:=lj-al IV , a = \J\/(\L\-\J\)<l, 

so f x hdx = 0. Since \J\ <\L\ J\, Lemma H~5l implies that 

\\h\\P < 2111 ll p — 21 71 

For out test function A h = h is the only non-zero martingale difference, so it follows from 
(^T5l) that 

WhAjbWpK ||Ajfc||p + ||E J (fcA J 6)|| p . 



We can estimate 
On the other hand, 



||/iA / &|| p >||l J / l A 7 6|| p = |J| 1 /P||A / 5|| 00 . 
\h\h\\ p <C l K\\h\\ p = C l K{2\J\) 1 l p 



and 



|E / (/iA / 6)|| p < \I\ 1/p i \hAjb\da 

< \I\ 1 / p - 1 \\A, 



'■ I \\p'\\i"\\p 

< \I\ 1 / p - 1 K\I\ 1 / p '{2\J\) 1 / p = 2 l ' p K\J\ l l p 
Combining all together we get that 

|J| 1 / p ||A / 6|| 00 < 2 l ' p CiK\J\ l ' p + 2 l l p K\J\ l l p , 
so\\A I b\\ 00 <CK,C = 2 1 ' p {C 1 + l). D 

5. BOUNDEDNESS OF COMMUTATORS IN LP 

5.1. Sufficiency. We start with a simple proposition. 

Proposition 5.1. Let p € (l,oo), and let T be a bounded in L p (equivalently in H^) mar- 
tingale transform. Let b be a locally integrable function. 

If the formal sum bo := J2ieC ^i^ ^ s ^ n $MO, the the commutator [M&, T] = M b T — TM b 
is bounded in LP (equivalently in H p ). Moreover, 

\\[M b ,T]\\ LP ^ LP < C||T|| L p_> LP ||&o|| BA/(0 , 
where C = C(p). 
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Remark. Note, that the case p = 1 is not included here. While the condition bo G BMO is 
necessary and sufficient for the boundedness of the paraproduct ir b in H 1 , this condition is 
not sufficient for the boundedness of the adjoint irf there, even in the simplest case of the 
standard dyadic grid. 

This can be easily seen by going to the dual space and noticing that the condition b € BMO 
is not sufficient for the boundedness of the paraproduct n b in BMO (we are considering the 
simplest case of the standard dyadic grid on M here, so all BMO spaces coincide). Since 
the condition / S BMO does not imply any bounds on the averages (f) T , on can take an 
unbounded function / € BMO (so the averages (/). are not uniformly bounded) and easily 
construct a function b € BMO such that ir b f ^ BMO. 

Proof of Proposition I5.il By Proposition 11.51 

M b = 4* ] +A° b +n b + R b . 

Operator A® commutes with all martingale transforms, so we can exclude it from the com- 
mutator. Since TR b = R b T = 0, we can exclude R b as well, so 

[M b ,T] = [ir b + 4*\T]. 

(*) 
Therefore, if TT b and tt^ are bounded, the commutator is bounded as well. But according to 

Theorem 14.61 the condition b G BMO implies the boundedness of both paraproducts n b and 

4' 



7iv*' (for q = 2 condition (i) of Theorem 14.61 follows from condition (ii) there). □ 



It will be shown later that in the case when Slj^ n C = and the martingale transform T 
has the right "mixing" properties, the condition Yli<=c ^j^ ^ BMO is also necessary for the 
boundedness of the commutator. If 2lJ^n£ ^ 0, the sufficient condition J2ieC ^i^ e BMO 
can be relaxed a little. As it will be shown below in Section [52] this relaxed condition is also 
necessary (again if the martingale transform T has the right "mixing" properties). 

5.2. Necessity. We want to state and prove an inverse (at least partial) to the above Propo- 
sition ETJ Of course, to prove such a theorem one needs to make some additional assumptions 
about the martingale transform T (for example, identity is a martingale transform, and it 
commutes with everything). 

Definition 5.2. Let T be a martingale transform. Following S. Janson [8], we say that an 
interval / € C with parent /' is (p, e, K) non-degenerate for T if there exists h = h , E D = 
A L 2 , such that 

(i) \\h\\ p = 1, 
(ii) h\ I = 0, 
(hi) ||l / T / ,/i|| p >e, 

(iv) ||/i||oo < AT|iT 1/p ^ I is "small", namely if |/| < \I'\/K. 
The last condition (iv) means that for "small" intervals / the function h has to be "spread" 
on the interval /'. 

If we skip condition (iv), we get the definition of (p, e) non-degenerate interval. 

We say that the martingale transform T is weakly (p, e, K) mixing if each interval / with 
a parent is either (p,e,K) non-degenerate for T or (p',e,K) non-degenerate for the adjoint 

We say that the martingale transform T is strongly (p, e, K) mixing if each interval / with 
a parent is (p, e, K) non-degenerate. 
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Using the notion of (p, e) non-degenerate intervals, one can define weakly and strongly 
(p, e) mixing martingale transforms. 

Remark. The above definition of weakly (p, e) mixing martingale transform is essentially a 
restatement (and a generalization) of the definition of a non-degenerate transform from [8] . 
It was given there for case of the uniform r-adic lattice, with all operators T, being equal 
(after canonical identification of all subspaces D, ) . 

For the case considered in |Hj, our definition coincides with one given there. The easiest 
way to see this equivalence is to look directly at the proof of Theorem 2 in [8J (at least that 
was the easiest way for me). 

Note also, that for homogeneous lattices the norms ||/||p|^| _1 ' p on D, are all equivalent. 
That means that any (weakly or strongly) (p, e) mixing martingale transform is also (p, e, K) 
mixing (resp. weakly or strongly) with appropriate K. It also mean that any (p, e) mixing 
martingale transform is also (r, e 1 ) mixing with appropriate e 1 . 

Recall that we defined the formal sum bo = Yliec ^ r b. Define also the formal sum 

(5.1) 6 := Y, A i b 

/eA21 0,fin 

(note that b = b if 21°^ n C = 0). 

Theorem 5.3. Let p G (l,oo) and letT be a strongly (p,e,K) mixing martingale transform, 
such that its blocks T, are uniformly bounded in L P Q 

If the commutator [T, Mb] is bounded in L p , then bo € BM.O = BM.O2; 

Moreover, for p = 2 it is sufficient to assume that T weakly (2, £,K) mixing martingale 
transform. 

Finally, the norm \\bo\\ BMO can be estimated by a constant depending onp, \\[Mb,T]\\ p , 
supj S £ H^Vll an d s, K from Definition \5.$\ . 

Proposition 5.4. Let p € (1, 00) and let T be a (possibly unbounded) strongly (p,e) mixing 
martingale transform (weakly (p, e) mixing for p = 2). 

If the commutator [Mb, T] is bounded in n^, then for any interval I G C\ SlJ^ a uniform 
estimate, 

(5.2) -ff Y, \ A j b \ 2 ) dx<C <oo, C 1/p = Ci||[M 6 ,T]||/e. 

where C\ = C\{p), holds. 

Proof of Proposition \5.4\ The proof directly follows [8j. For an interval /, let I' be its parent, 



so I € child (i 7 ). We know that that / is (p, e) non-degenerate. Let h = h , G D , be the 
function from Definition 15.21 such that \\h\\ p = 1, h\j = and ||1,T, /i|| p > e. 

Note that \\h\\ p = \\h\\ H P. 

Recall that the function Th = T ' ,h is constant on /, and let c be its value there. The 
inequality ||l.T,/i|| p > e means that \c\ > e\I\~ l > p . 

We get that on I 

M b Th = cb. 



Note, that for p/2 this condition is weaker than boundedness of T in L p . 
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On the other hand, bh = on /, so (Th)\i is a constant, so for J C I 

Aj ((M b T - TM b )h) = cAjh. 

o 

The fact that M b T — TM b is bounded in ffi implies that 



2 
,1/2 

£ A,, 

JeCJci 



<C, C= If [M b ,T] ||, 



p 



so taking into account that |c| > e|/|~ 1 ' p we get the conclusion of the proposition. 

For p = 2, we can assume that T is weakly (p, e) mixing, because if / is {p, e) non-degenerate 
for T' , we can consider the adjoint of the commutator, to get the same conclusion. This would 
not work for p ^ 2, because in this case we get the estimate with the exponent p' instead of 

p. □ 

5.2.1. Proof of Theorem 1 5. 'A To prove the theorem we need to show that ||A 7 6||oo are uni- 
formly bounded for all / G C \ 2l_'o^. 

Consider an interval (let us call it /') belonging to C \ SlJ^. Notice that inequality 
(J5T2|) implies that ||A ,6|| p < C 1/p \I'\ 1/p < oo. Assume that M := ||A ^Hoo is attained on 

/ G child(/')- 

We can assume that \I\ < \I'\/K, because otherwise 

ni.A^fcii^ = iir^A^ < m-'w^Mi < \i\- l c\r\ < kc. 

Define 

g = l I - 1 l nv 

where the constant 7 is chosen so f^gdx = 0. Let E C Dp be the annihilator of g in Dp 



J gfdx = o}. 



E=ifeD P : gfdx 

L Jx 

Note that E consist of all functions / G Dp supported outside of /. Indeed, any such function 
annihilates g, and counting dimensions, we can conclude that we got all the functions in the 
annihilator. 

Such structure of E implies that 

[ fgdx = V/eB, 
Jx 

so E is the orthogonal complement of g in Dp. Therefore, Dp can be decomposed into the 

direct sum of spanjg} and E. 

We can decompose 

(5.3) (A Il b)T I ,h = ag + f + E I ,[(A I ,b)(T r h)], f G E. 

By the assumption (hi) about h, \T ,h\ > el/l" 1 ^ on /. Therefore, since / ,= and g ,= 1, 
we get from (|5.3p by restricting it to / and comparing L p norms (divided by |/| 1 ' p ), that 

(5.4) elll-VPHA^&lloo < \a\ + \\E I ,[(A Il b)(T Jt h)]\\oo 



<H + |/r 1 ii^ J ,fcii P 'iir 7/ i 



ip 



< |«| + C|7 / |- 1 ||A 7 ,6||^. 
So, to estimate ||A w6||cx) we need to estimate both terms in the right side of ([5 
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We get the bound on \a\ from the boundedness of the commutator. Namely, since 

b\ I ,= E I ,b + A I ,b+ Yl A J ,6=:E /; 6 + A /; 6 + 6 / ' 
j&l-.j^i 1 

and b 1 _L D , b 1 D _L D , we can write 

(M b Th,g) = (M b T r h,g) = ((A Jt b)T r h,g) + ((E^T^g), 
(TM b h,g) = {T r M b h,g) = (T p (\,b)h,g) + (T^E^g). 

Here we slightly abusing the notation by treating T , as the operator on all L p , i.e. as a 
martingale transform whose only non-zero block is T , (we need to do that because (A T ,b)h 
does not generally belongs to D ,). In this context T , (A ,b)h = T , [(A ,b)h — E , ((A ,b)h)] , 
where T in the right side can be treated as a block acting in D . 

Using the fact that (E ,b)T ,h = T ,(E ,b)h we conclude, again abusing the notation as 
above, that for the commutator [M b , T] = M b T — TM b 

(5.5) ([M b ,T]h,g) = {{A It b)T r h,g) - (T^A^Kg). 

We get from flO} that 



(5.6) 



((A v b)T v h,g) 



\a\ ■ \\gW2 > M • \I\- 



By Lemma 14.51 ||g|| p ' < 2 1 ' p ||1t|| p ' = 2 1 ' p |/| 1 ' p . Using this estimate and the assumption 

i|U < K\r\- l ' p , we get 



{T^A^Kg) 



< \\T r . 



\h\\oo\\A If b\\p\\g\\p> 



< CK\l'\- 1/p C\l'\ 1/p 2 1/p \I\ 1/p ' < C\I\ 1/p '. 
Using the above estimate together with the estimate 



\([M b ,T]h,g)\<C\ 
we get from (|5.5p and (|5.6[) that 



\p\\y \\p 



, <C-1- l/l 1 ^' 



Id • HI < 



((A 7 ,6)T 7 , /*,<?) 



^{[M^T^g^ + W^A^g) 

< C\I\ 1/p ' + C\I\ 1/p ' = C\I\ 1/p \ 



so 



|a| < C\I\- 1/p . 
Combining the last inequality with (j5.4|) we get 



(5.7) 



IA^&Hoo < j — ' — M + c l -!—\\A I M\ pl \r 



'i-i 



so 



<C + C|J| 1 /'|J'|- 1 ||A J# 6||p/ 
If p' < p, Holder inequality implies that 

i/r 1/p 'iiA // 6n p/ <|/r l/p iiA / ,6ii P <c 

II A / ,6||oo < C + C\I\ 1/p \I'\- 1/p < C. 
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If p' > p, Lemma 15.51 below implies that 

\\A I M\ P '<\\A I M\l /p '\\A I ,b\\ 1 ^ p/p ' 

and we get from (J5.7J) 

(5-8) \\A r b\\oo < C7 + C7|/| 1 /P|l'|-i||A / ,5||^'||A // 6||L- p/p ' 

< C + C\I\ 1 / p \I'\- 1 / p \\A I ,b\\ 1 ^ p/p \ 
the last inequality being true because 

|I / |- 1 /i'||A // 6||g/ p ' = (\I'\- 1/p \\A If b\\ P ) P/P ' < C p l p ' < C. 
Since |/| < \I'\, (|5.8p implies 

II A h\\ <r , -i_r , HA h\\ 1 ~p/p' 

\\l±ji°\\oo 2i t> t- o n^i^oiioQ , 
which gives us a bound ||A r ,b\\oo < C . □ 

Lemma 5.5. Let f be a bounded measurable function on a measure space X. Then for any 

11/11, < \\f\\ P P /q \\f\\^ p/q 
Proof. 

\\f\\ q q = f \f\ q dn = f \f\ p \f\ q ~ p dn < \\fW£ p f \f\ p d^ = \\f\\ q - p \\f\\ P P 
Jx Jx Jx 

and raising this inequality to the power 1/q we get the conclusion of the lemma. □ 

5.3. Relaxing sufficient condition. If C n SL^ = 0, we have bo = bo, so bo € BM.O is a 
necessary and sufficient condition for the boundedness of the commutator [M&, T] (provided 
that T satisfies assumptions of Theorem 15. 3p . 



If C n 2t_!o^ 7^ there is a gap between necessary and sufficient conditions. Notice, that 
the situation £ n SljJ^ 7^ is not an exotic one. For example, it happens in the classical 
martingale situation, which in our notation mean that Ck = £0 = X f° r an k < 0, |X| = 1. 

To bridge the gap between necessary and sufficient conditions in the case C n SlJ^ 7^ 0, 
we can relax sufficient conditions in Proposition 15.11 

Proposition 5.6. Let b be a locally integrable function, and let T be a bounded in L p mar- 
tingale transform. Assume that 

(i) 60 € BM.O, where bo is defined by (|5.ip . - 
(ii) For any I £ £n 21^ 

\\TjAjb\lp < CiUljUp = Ci|i"| 1/p , ||T;A 7 6|| p , < CxWlfWp, = Ci\I\W; 

(iii) For any I GCH 21^ 

\\[( A l)r T i]\\ LP -, LP <C2<oo; 

here (A^), is the restriction of A^ on Dj. 
Then the commutator [Mb,T] is bounded in LP , and 

||[M b ,T]|| L ^ LP <C7(||r|| LP ^ LP ||5 || e _ MO +C7 1 +C7 2 ), 

where C = C(p) and C\ C<i are the constants from (ii), (iii). 
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The proof of the theorem is obvious, since for any / E C n 2l_'o^ the conditions (ii), (iii) 
are necessary and sufficient for the boundedness of the commutator [M, _r , T] in LP . The 
necessity here is quite easy: condition (ii) is obtained by testing the commutator [M & _^ , T] 
and its adjoint on the function 1,. To get the condition (iii) one needs to restrict everything 
to the subspace Dj . 

Remark 5.7. As it follows from the above discussion, it T satisfies assumptions of Theorem 
15.31 then conditions (i)-(iii) of Proposition [5T6l are necessary and sufficient for the boundedness 
of the commutator [Mf,,T] in L p . 

5.4. Some examples and counterexamples. In this subsection we present examples which 
will show us that 

(i) Boundedness of the commutator [Af&, T] does not imply any bounds on A,h for 

/€£n2l°^; 
(ii) If the martingale transform T is only strongly (p, e) mixing (not strongly (p, e, K) 
mixing), then the boundedness of the commutator [A/j,, T] does not imply any bounds 
on || A. 6| |oo, I € C. That means that the new condition (iv) in Definition 15.21 is 
essential and cannot be skipped. 

The main building block of our construction will be as follows. Let an interval I be divided 
into 2 subintervals I 1,2 , I/ 1 !/)/ 2 ! = 6 > 0. Divide I 1 into 4 equal intervals Ik, 1 < k < 4 and 
I 2 into 4 equal intervals Ik, 5 < k < 8. 

The intervals Ik will be the children of /. Define the "Haar functions" h , = hj € D f 

h k :=l T -1. , KJfe<4. 

Note, that the functions h do not span the martingale difference subspace D,. Define also 
a "Haar function" h = h T G D,, h = 1 x — 51 2 . 
On Dj define a block T, , 



T I h 1 = h 2 , T I h 2 = h 1 , 

Tjh 3 = h A , Tjh 4 = h 3 , Tj 



= 0. 

span{/i fc :l<fc<4} x 



If A, 6 = ah T , then the block (A^), of A^ commutes with T,. This together with the fact 

that TMj = implies that if / G C n StJoo and the block T, of a martingale transform T is 
as described above, then multiplication operator M/ l/ commutes with T 

So, if we add to b any multiple of hj , we will not be able to detect it by looking at the 
commutator [Mb,T], which gives a example for the statement (i) above. 

To give an example to statement (ii), take a finite interval Iq =: X, divide it into 8 
subintervals, as it was described above (with 8 = 5\) to get the "children" of Iq, then divide 
each child into 8 parts, and so on. We assume that on each step we take 5 = 5 n , 5 n — > as 
n — )• oo. That will be our lattice C 

Let T be a martingale transform on C, where each block T, is as described above. Notice, 
that T is strongly (p, e) mixing (but not strongly (p,e,K) mixing). Notice also, that clearly 
T is bounded in L 2 . 

Take an interval I G C\ 21°!^ = £\{I }. 

Take p = 2 and define h = h, = 5^ 1 ' 2 h, where h = h, is the "Haar function" defined 

above, h = 1 . — 61 r0 . 
I 1 i z 
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By Lemma 14.5} \\h\\2 < 2 1 ' 2 |I| 1 ' 2 . On the other hand, ||/i||oo = <5 _1 , so we can pick / 
such that 1 1 /i | |oo is as large as we want. 

Note that for b = h, the martingale transform T commutes with A^ (and so with A b ), so 
it is easy to check that the paraproducts -Kb, tt^ and so the commutator [M&, T] are bounded. 
However, as we discussed above, ||6||oo = S~ 1 ' 2 . 

So, if we consider a collection C of disjoint intervals in £ \ SP^ with <5 — >■ 0, and define 

b = E~ h i 

lee 
then the commutator [Mfe,T] is bounded. That can be seen, for example, by noticing that 
A^ commutes with T (one needs to treat each block separately, which reduces it to the case 
b = h T ), and the paraproducts 7T{, and tt^ are "direct sums" of the paraproducts with b = h,, 
treated above. 

So we constructed an example of b and a strongly (p, e) mixing martingale transform T 
such that the commutator [Mb,T] is bounded in L 2 , but sup Je £ [|AJ|oo = oo. 

An easy modification allows also to get an example for IP . 
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